Picture for Le Xu

Le Xu

Lodestar: An Online-Learning LLM Inference Router

Add code
May 31, 2026
Viaarxiv icon

A Triple-Modal Contrastive Learning Framework with Sequence, Graph, and 3D Features for Drug-Target Interaction Prediction

Add code
May 28, 2026
Viaarxiv icon

SlideSparse: Fast and Flexible (2N-2):2N Structured Sparsity

Add code
Mar 05, 2026
Viaarxiv icon

Covo-Audio Technical Report

Add code
Feb 10, 2026
Viaarxiv icon

OV-InstructTTS: Towards Open-Vocabulary Instruct Text-to-Speech

Add code
Jan 04, 2026
Viaarxiv icon

GaussianDWM: 3D Gaussian Driving World Model for Unified Scene Understanding and Multi-Modal Generation

Add code
Dec 29, 2025
Viaarxiv icon

OmniDrive-R1: Reinforcement-driven Interleaved Multi-modal Chain-of-Thought for Trustworthy Vision-Language Autonomous Driving

Add code
Dec 16, 2025
Figure 1 for OmniDrive-R1: Reinforcement-driven Interleaved Multi-modal Chain-of-Thought for Trustworthy Vision-Language Autonomous Driving
Figure 2 for OmniDrive-R1: Reinforcement-driven Interleaved Multi-modal Chain-of-Thought for Trustworthy Vision-Language Autonomous Driving
Figure 3 for OmniDrive-R1: Reinforcement-driven Interleaved Multi-modal Chain-of-Thought for Trustworthy Vision-Language Autonomous Driving
Figure 4 for OmniDrive-R1: Reinforcement-driven Interleaved Multi-modal Chain-of-Thought for Trustworthy Vision-Language Autonomous Driving
Viaarxiv icon

Perception Activator: An intuitive and portable framework for brain cognitive exploration

Add code
Jul 03, 2025
Viaarxiv icon

Mitigating Audiovisual Mismatch in Visual-Guide Audio Captioning

Add code
May 28, 2025
Viaarxiv icon

Hearing from Silence: Reasoning Audio Descriptions from Silent Videos via Vision-Language Model

Add code
May 19, 2025
Viaarxiv icon